AITopics | random factor

Collaborating Authors

random factor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Instance-level Randomization: Toward More Stable LLM Evaluations

Li, Yiyang, Wu, Yonghuang, Luo, Ying, Sun, Liangtai, Qin, Zishu, Qiu, Lin, Cao, Xuezhi, Cai, Xunliang

arXiv.org Artificial IntelligenceSep-17-2025

Evaluations of large language models (LLMs) suffer from instability, where small changes of random factors such as few-shot examples can lead to drastic fluctuations of scores and even model rankings. Moreover, different LLMs can have different preferences for a certain setting of random factors. As a result, using a fixed setting of random factors, which is often adopted as the paradigm of current evaluations, can lead to potential unfair comparisons between LLMs. To mitigate the volatility of evaluations, we first theoretically analyze the sources of variance induced by changes in random factors. Targeting these specific sources, we then propose the instance-level randomization (ILR) method to reduce variance and enhance fairness in model comparisons. Instead of using a fixed setting across the whole benchmark in a single experiment, we randomize all factors that affect evaluation scores for every single instance, run multiple experiments and report the averaged score. Theoretical analyses and empirical results demonstrate that ILR can reduce the variance and unfair comparisons caused by random factors, as well as achieve similar robustness level with less than half computational cost compared with previous methods.

large language model, natural language, random factor, (17 more...)

arXiv.org Artificial Intelligence

2509.12678

Country:

North America > United States (0.68)
Europe (0.68)

Genre:

Research Report > New Finding (0.66)
Research Report > Experimental Study (0.46)

Industry: Leisure & Entertainment > Sports (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Sensitivity Analysis on Policy-Augmented Graphical Hybrid Models with Shapley Value Estimation

Zhao, Junkai, Xie, Wei, Luo, Jun

arXiv.org Machine LearningDec-1-2024

Driven by the critical challenges in biomanufacturing, including high complexity and high uncertainty, we propose a comprehensive and computationally efficient sensitivity analysis framework for general nonlinear policy-augmented knowledge graphical (pKG) hybrid models that characterize the risk- and science-based understandings of underlying stochastic decision process mechanisms. The criticality of each input (i.e., random factors, policy parameters, and model parameters) is measured by applying Shapley value (SV) sensitivity analysis to pKG (called SV-pKG), accounting for process causal interdependences. To quickly assess the SV for heavily instrumented bioprocesses, we approximate their dynamics with linear Gaussian pKG models and improve the SV estimation efficiency by utilizing the linear Gaussian properties. In addition, we propose an effective permutation sampling method with TFWW transformation and variance reduction techniques, namely the quasi-Monte Carlo and antithetic sampling methods, to further improve the sampling efficiency and estimation accuracy of SV for both general nonlinear and linear Gaussian pKG models. Our proposed framework can benefit efficient interpretation and support stable optimal process control in biomanufacturing.

artificial intelligence, machine learning, pkg model, (18 more...)

arXiv.org Machine Learning

2411.13396

Country:

Asia > China > Shanghai > Shanghai (0.04)
Europe > Italy > Marche > Ancona Province > Ancona (0.04)
North America > United States > Maryland > Montgomery County > Silver Spring (0.04)

Genre: Research Report > New Finding (0.45)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Health & Medicine > Public Health (0.67)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Modeling & Simulation (0.92)
Information Technology > Game Theory (0.84)
Information Technology > Data Science (0.67)
(2 more...)

Add feedback

ArtificialLife Newsletter

#artificialintelligenceOct-25-2019, 10:52:03 GMT

Time flies when you are busy. Last weeks I have been really busy while developing and testing some ideas for Planetrism. I have not posted any updates specifically on Planetrism because last years have taught, that there is nothing gained from constant updates. Not only do frequent updates take valuable development time, they do not add any value to development itself. Ultimately development should focus on developing the game itself (and get it ready), instead of developing updates constantly. Also, it seems that social media has changed a lot during past two years. Something that gained lots of attention 2 years ago (and gained some input that was valuable to development), gets only minimal views currently.

generator, probability, treasure, (15 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Computer Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.71)
Information Technology > Communications > Social Media (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)

Add feedback